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[57] ABSTRACT 

Pixel-based image data are subsampled for transmission to 
a printer and later bilinear interpolation in the printer. The 
transmission system evaluates the error that will remain after 
interpolation. In response to this evaluation, the transmission 
system also forms trim data for later application to the 
interpolated subsample to remove a significant part of the 
error. Preferably trim data points are selected based on 
relative importance of the associated correction — 
specifically, size of the error that will remain after 
interpolation — but also the amount of trim data is controlled 
to maintain a significant advantage in overall transmission 
time, relative to transmitting the entire image-data set. The 
amount of trim data is held roughly to a certain fraction, 
about five percent, of the number of points in each data block 
(e. g., swath), essentially independent of the error evalua- 
tion. Preferably this controlled amount of trim data is padded 
out with zeroes to form a complete trim-data array — which 
is then run-length encoded. The encoded trim data and the 
subsample are sent to the printer. (Preferably pure-text 
portions are sent separately.) The printer applies the trim 
data to the interpolated subsample to obtain a better approxi- 
mation to the original image data. 

15 Claims, 9 Drawing Sheets 
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SYSTEM OF DATA COMPRESSION BY 
SUBSAMPLING, AND PREFERABLY 

ADAPTIVE TRIM ACCOUNTING, FOR 
SENDING IMAGES TO A PRINTER 

RELATED PATENT DOCUMENT 

A related document is U.S. utility-patent application Ser. 
No. 08/644,808 filed in the United States Patent and Trade- 
mark Office on May 10, 1996, in the name of Jae Noh, It is 
hereby incorporated by reference in its entirety into this 
document. That document deals with methods for recogniz- 
ing and visually enhancing text within an image that is being 
read in for printing. 

FIELD OF THE INVENTION 

This invention relates generally to machines and proce- 
dures for printing color pictures, graphics or text, or com- 
binations of these, on printing media such as paper, trans- 
parency stock, or other glossy media; and more particularly 
to apparatus and method that construct such images from 
individual ink spots created on a printing medium, in a 
two-dimensional pixel array which is derived from an 
image-data array developed in a host computer. 

BACKGROUND OF THE INVENTION 

The invention is of particular use in such apparatus and 
method that aim for high fidelity in reproduction of colors 
and fine image details. Such approaches strive for near- 
photographic quality, by maintaining multilevel digital rep- 
resentation of the visual elements all the way to the final 
processes in the printer. 

(a) Colorimetric dynamic range — Some such systems and 
techniques may transmit to a printer multilevel red, green 
and blue (RGB) colored-light clata representing each pixel of 
an original image, for processing (usually called 
"rendition") within the printer in preparation for either 
binary or multilevel color printing and by any one of a 
variety of approaches. For instance each of the three colors 
might be represented on an eight-bit digital scale, 256 levels 
(starting with zero) for each color, coming to a total of 
twenty-four bits at each pixel. This is the arrangement which 
we currently prefer. 

Other such systems and techniques might entail transmis- 
sion of multilevel data for three chromatic colorants cus- 
tomarily cyan, magenta and yellow plus black (CMYK), at 
each pixel — each of the four colors being represented on a 
six-bit digital scale, sixty-four levels for each color, again 
coming to twenty-four bits at each pixel. In this case, 
whatever rendition is desired may be performed before 
transmission of data to the printer, and the data which are 
transmitted are considered essentially ready to apply in the 
actual printing process. Techniques of this type, while we do 
not currently prefer them, can be practical and are within the 
scope of the present invention. 

In conjunction with multiple data levels, such modem 
environments in computer-controlled printers typically use 
very high spatial resolution, for instance twelve to twenty- 
four pixels per millimeter (300 to 600 pixels per inch). Thus 
in both the colorimetric and spatial domains these printing 
environments effectively approach the capabilities of the 
photographic process. 

For purposes of discussion we shall use the lower of these 
two resolution levels just mentioned. The data density 
corresponding to such performance, for a standardize sheet 
of printing medium, taking the pictorial image area as about 
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203 by 254 millimeters (eight by ten inches), comes to about 
24xl2 2 x203x254 (or 24x300 2 x8x 10) -about 173 million 
bits. 

(b) Data transmission rates and times — Another part of 
5 the printing environments under discussion, however, is the 

data source: most commonly a personal computer, and 
nowadays one operating with the currently popular Win- 
dows® operating system. In the versions of that operating 
system that are now in greatest use, the average parallel data 

10 throughput to a printer is roughly 30 thousand Bytes per 
second (each Byte being eight bits), or 0.24 million bits per 
second, coming to some 15 million bits per minute. 

Such printing environments tend to severely limit the 
overall printing throughput. Transmitting a full image on the 

15 basis of these figures requires 173-15=11 Vi minutes. 

In the highly competitive modern world of computer- 
controlled high-quality color printing, even for a relatively 
economical printer in the range of perhaps $400 to $600, this 

2Q would be unacceptably slow. The desired or acceptable 
page-throughput range, depending upon price and 
application, may be from perhaps one to five minutes — thus 
calling for improvement by a factor of some 2 X A to 1VA. 
In one current product development of the Hewlett Pack- 

25 ard Company, a design objective is four-minute page 
throughput — requiring an improvement by a factor of about 
three. As more users turn to faster operating systems such as 
Windows 95® and Windows®, typical transmission speeds 
may rise, but probably not as rapidly as will consumer 

3P expectations for throughput. A likely result will be a com- 
mercial requirement for improvement by factors exceeding 
three. 

(c) Data compression — A well-known way to reduce data 
transmission times is to somehow encode the data to lower 

35 "the~am"ount of data bits actually passing from one apparatus 
to another. Among the most common methods is run-length 
encoding (RLE), which transmits the number of like char- 
acters in an uninterrupted row (or "run") and identifies the 
particular character which is subject to that run. 

40 In a typically complex pictorial image, the number of 
characters in a run may sometimes be great enough to 
produce a compression factor exceeding two, but cannot be 
relied upon to produce such an improvement. Whatever 
technique is used to garner an improvement must be re a - 

45 sonably reliable, consistent, so that the throughput can be 
stated and advertised. 

Among other algorithm types currently available for 
image data are those known as JPEG, chrominance 
subsampling, Huffman encoding, and Lempel-Ziv-Welch 

50 (LZW) compression. Of these, the more simple types cannot 
reliably produce the needed factors of improvement. 

The others require relatively complicated processing, 
which in turn is inconsistent with the economical selling- 

55 price range mentioned above. In the JPEG method, for 
example, frequency content of the data are analyzed; while 
not as costly as full Fourier analysis, this method is still too 
expensive. 

Chrominance subsampling requires computations in the 
so chroma domain — and for this purpose light signals (RGB) or 
colorant signals (CMYK) must be first converted to some 
variable related to chroma. This is a relatively costly and 
time-consuming proposition. 
Huffman encoding is described by W. Pennebaker and J. 
65 Mitchell in their book JPEG Still Image Data Compression 
Standard, as follows. The Huffman technique belongs to the 
class of entropy coding and involves statistical analysis. For 
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a set of symbols to be coded, the most probable one will (d) Conclusion — The above -discussed limitations on data 
have the shortest code, and the least likely will have the transfer, in relation to the size of modern data files for 
longest. Here is an example: direction to computer-controlled modern printers, have con- 

tinued to impede achievement of uniformly excellent inkjet 
printing — at high throughput. Thus important aspects of the 
technology used in the field of the invention remain ame- 
nable to useful refinement. 

SUMMARY OF THE DISCLOSURE 

The present invention introduces such refinement. A sat- 
isfactory solution requires a careful balance between algo- 

The code used in the "simple" column has two bits per rithmic complexity and compression efficiency, 

possibility, independent of the likelihood of having to use What complexity is permitted should be pushed into the 

the code. The "better" column transmits a single bit for the host computer, allowing for fast and simple hardware 

eventuality that occurs in the great majority of cases 15 decompression at the target— in the printer. This objective, 

(preferably a zero for optimum use of RLE encoding and for howcvcVj must not be pursued to the point of significant 

discrimination against all the other codes, that begin with adver5C per f ormance impac t 0 n the host (in other words to 

arise' less hTZti 0CCaS1 ° nS the P0int 0f ^'ykg its welcome). The present invention 

^^e^suSnumber of bits per symbol, on the average, on ^^^^ these constraints as well as the other considerations 

is only 3 /4xl+ I / 8 x2+2x 1 /i6x3=iy 8 — which is an improvement 20 discussecl at,ove * 

of about thirty percent, relative to two bits for the column In its preferred embodiments, the present invention has 

headed "simple". The computational overhead of such a several aspects or facets that can be used independently, 

system for our present purposes, however, would be unac- although they are preferably employed together to optimize 

ceptable. their benefits. With respect to a first of these principal 

LZW belongs to the category of so-called "dictionary 25 aspects, the invention exists within a system for transferring 

schemes". It works by building a dictionary of symbols as pixel -based image data from an image-data source into a 

the date are being treated. With a long enough data set, a printer and for employing those data in the printer, 

good dictionary-building scheme, and long repetitive Most commonly such a system includes portions of a host 

symbols, this type of approach works well— mainly due to computer, and of a printer, and of software called a printer 

the fact that sending the reference location of a symbol in the 30 dtiycf , that opcfatcs , he computcrf and of hardwarc . 

dictionary is more efficient than sending the symbol itself. cmbedded firmware that operates the printen (0th er 

The patented Welch form ^o ^ trus method is widely used. forms of imp i cm cntation are within the scope of the 

Both the Huffman and LZW methods are classed as invention) 

"lossless" It has been shown that Jossless compression- ^ ^ thin such a " system the present inventive combi- 

methods when applied to natural images usually can achieve 35 „ ot - lfc .„ir „ -ii k _ m ,„ l & „ ,u 

i c t c i . <-» i i i_ . „, . nation itself, as will be seen, may be part of the computer 

only rauos of L5:l to 2:1. Examples can be seen in Wein- and/of ^ ^ of {i b > ^ conjunction with 

Compression Me hod ISO/ffiC JTC 1, SC 29, WG1, invemion correspond t0 the fct major t or facet 

document N203 (Jul. 10, 1995). discussed below 

More to the present point, LZW methods — like 40 XT - * , . r L . ^ e . e 
Huffman's^ntail substantial algorithmic overhead for Now " 'P referred embodiments of this .first of its facets or 
building of the dictionary. LZW may also be awkward to as P ects ' ? c mventive combination includes some means for 
implement in real time, or without using large amounts of ^bsamphng an entire set of image data, to form a subsample 
upstream storage capability. for ^mission and later interpolation. For purposes of 
Most image-compression algorithms attempt to solve the 45 8»^y and breadth m discussing the invention, these 
problem of data-array size, whereas the problem at hand meanS Wl11 be caUed Sun ^ the "Sampling means", 
really is one of transmission time. As suggested in the The combmation also includes some means for evaluating 
preceding paragraphs, algorithmic complexity may influ- crror that Wl11 remain u P° n interpolating the subsample— i. 
ence cost and also may itself constrain transmission time: c * a rcsidual error, equal to the differences between the 
computation time for compression and decompression 50 on S inal P oints and mc interpolated subsample. These 
stages is a factor in overall throughput. means, again for generality, will be called the "error- 
Compression typically is performed in an already-existing evaluating means", 
relatively fast computing environment, at a point in the Further the combination includes some means for forming 
overall system that is typically shared with many other trim data— again, the "trim-data forming means". The trim- 
computing processes. Therefore the overall result may not 55 data Arming means are responsive to the error-evaluating 
be extremely sensitive to complexity in the compression means. 

stage. The trim data formed by these means are for application 

Decompression, however, must occur in the printer. Here, t0 trjc interpolated subsample, to remove a significant part of 

designing in a requirement for rapid complex computation, mc error. In other words, the trim data refine the interpolated 

at a point which is dedicated to the printer itself, translates 60 subsample so that the refined version is more closely related 

into prohibitive cost. to the original data. In accordance with the invention, 

Additionally, the "photographic" quality output of the however, ordinarily the trim data when applied to the 

current generation of color printer products demands that interpolated subsample will not remove all of the error, 

any image manipulation be visually lossless. This consid- The combination additionally includes some means for 

eration militates against some of the familiar techniques for 65 transmitting, to the printer, the subsample and the trim data, 

data compression, which sometimes tend to create percep- The foregoing may constitute a description or definition 

tible shifts in visual quality. of the first facet of the invention in its broadest or most 
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general form. Even in this general form, however, it can be substantially constant fraction of the number of points in the 

seen that this aspect of the invention significantly mitigates entire image-data set, substantially independent of the error- 

the difficulties left unresolved in the art. evaluating means. (As mentioned above, the "amount of 

In particular, the invention provides a family of very trim data" preferably excludes the omitted points for the 
simple compression and decompression procedures that 5 pixels that are in the subsample.) We prefer to set that 
achieve the relatively low compression factors required, and fraction to approximately five percent, 
do so with minimal processing overhead. Within this range In another important alternative, or variant, we prefer that 
of compression factors, the parameters of the procedure are the trim-data forming means instead control the amount of 
also easily scaled to accommodate various specific con- trim data for defined portions of the subsample, at a sub- 
strains of throughput and image size. i° stantially constant fraction of the number of points in each 

Thereby, accuracy of the trim data (and so overall trans- of the defined portions. As an example, this variant is 

mission accuracy of the link) can be selected in a generally especially useful when the invention is for use with a printer 

direct relation with available throughput, or inverse relation that constructs an image on a printing medium by scanning 

with desired compression factor — i. e., with image size. of a multielement printhead across the medium to form 

Many strategies for constacting the trim data are possible 15 swaths of dots. 

within the scope of the invention. In this case each of the "defined portions" is advanta- 
Although this first aspect of the invention in its broad geously a swath of dots; and the trim -data forming means 
form thus represents a significant advance in the art, it is control the amount of trim data for each swath at a substan- 
preferably practiced in conjunction with certain other fea- tially constant fraction of the number of points in the swath, 
tures or characteristics that further enhance enjoyment of 20 In other words, if the printing system operates on a swath- 
overall benefits. For example, it is preferred that the sub- wise basis it can be especially effective and simple to design 
sampling means subsample the entire image-data set sub- the compression system to operate in the same way — 
stantially periodically within the entire set — or within compressing, transmitting, and decompressing each swath in 
defined portions of the entire set, as for instance within turn, just preliminary to printing. For this paradigm too, the 
swaths of a scanning swath printhead. 25 preferable fraction is roughly five percent. 

It is also preferred that the trim-data forming means It is also preferred that the first facet of the invention 

control the amount of trim data, in such a way as to maintain include some means for transmitting pure-text portions of an 

a significant advantage in overall transmission time of the image separately from the subsample and the trim data. This 

subsample and the trim data — relative to transmitting the 3Q preference arises from the fact that in dealing with text at 

entire image-data set. In other words the amount of trim data certain font sizes the invention, while usable, tends to 

is advantageously controlled dynamically, or adaptively, produce image quality that is slightly less than ideal, 

through calculations working backward from the desired It ^ preferred that the invention include means for 

performance. storage of automated instructions for operation- of all of the 

AJso preferably the trim-data forming means.select trim- 35 foregoing means. Correspondingly the invention preferably 

-data points for use based upon the relative importance of the includes means for retrieval of the instructions from storage 

corresponding correction. In this regard we prefer, more anc j application of the retrieved instructions to operate all of 

specifically, that the trim-data forming means select trim- those foregoing means 

data points for use based upon the size of the error that wiU „ fa a]so ferred ^ ^ further 

remain upon interpolating the subsample. 4Q indude> ^ (he prin(ef) comtpm ^ Bg provisions for inter- 

Still another preference is that the trim-data forming acting ^ the aforementioned elements of the sending 

means control the amount of tnm data for a data block, at a S y Stem . Such provisions are discussed below in relation to 

substantially constant fraction of the number of points in , he second major aspect or facet of me invention, 

said data block, substantially independent of the error- T f . u j- . c r-. 

evaluating means. Here we mean the phrase "data block" in « „ In 6ffl bodiment 5 of a second of its major aspects 

a general way-to encompass either (1) the entire image- ,hc mv ™* • system for receiving and using pixel-based 

data set or (2) one of a group of defined portions of that ma f dat » b A om m source, m a printer. This 

entire imace-data set system includes some means for receiving a subsample of an 

„ Ti ( . . * , ... entire set of image data. 

When the trim-data forming means select data points, it is . 

further preferred that these means further include some 50 e ^ also includes means for mterpolatmg the subsample, to 

means for padding out the controlled amount of trim data ^ Tm an approximation of the entire set of image data; means 

with zeroes to form a complete trim-data array. (In one sense for «?™°B tnm data; and means for applying the trim data 

the array is not completely complete: we prefer not to to the interpolated subsample to remove a significant part of 

include any zeroes or other data for those particular pixels error 

which are in the subsample. Data for those pixels would be 55 As will now be appreciated, preferred embodiments of 

redundant, and the printer is easily taught to skip over those this second main facet of the invention also include means 

pixels when applying the trim data.) We also prefer to for applying the trimmed interpolated subsample to control 

include further means for run-length encoding the padded- printing of a corresponding image by the printer, 

out trim-data array. The foregoing may constitute a description or definition 

Thus the first aspect of our invention in effect transforms so of the second facet of the invention in its broadest or most 

the data format into one which is amenable to this familiar, general form. Even in this general form, however, it can be 

simple and effective RLE technique. The initial full image- seen that this aspect of the invention too significantly 

array data format, as mentioned earlier, is incompatible with mitigates the difficulties left unresolved in the art. 

such handling. In particular, as to this aspect of the invention we focus 

Moreover in one alternative form of this first main aspect 65 upon the notably simple and fast portions of the overall 

of the invention, it is preferable that the trim-data forming procedure that are left to be done by the printer firmware, 

means control (i. e. hold) the amount of the trim data at a The printer simply: 
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receives two data streams, consideration of the following detailed description, with 

accumulates them into two predefined data blocks, reference to the appended drawings, of which: 
expands one by interpolation and the other by run-length 

decoding (RLD) so that they are both identical in size BRIEF DESCRIPTION OF THE DRAWINGS 

(full-size arrays) for the particular data blocks, 5 rj „ i . .iij* t - 4i - c 

, , . . f . FIG. 1 is a block-diagrammatic representation or. a 

adds the two arrays, pomt for point, and transmission-stage system (i. e.. most typically a computer 

V H\T 1 1" 01 array i convB J ,tl ?"? n ?- . ., . A with printer driver), and also of processes within that 

Although this second aspect of the invention in its broad atus> accorfin to &c 

form thus represents a significant advance in the art, it is rr & 

preferably practiced in conjunction with certain other fea- 10 FIG * 2 fe a ^ representation of a receiving-stage system, 

tures or characteristics that further enhance enjoyment of e * printer, and also of the processes which the printer 

overall benefits. performs; 

For example, it is preferred that the system further include FIGS. 3 are a schematic representation of data flow within 

some means for storage of automated instructions for opera- the apparatus of FIGS. 1 and 2, for an arbitrarily constructed 

tion of all of the foregoing means; and corresponding means 15 example of a portion of an image — and in particular: 

for retrieval of the instructions from storage and application u 4U f . , „ „ . . . . A . 

* . . j • . . , ,? C(U ^ FIG. 3a is a hypothetical very small original image data 

of the retrieved instructions to operate all of those means. . . i r <i_ t-t^ , , . r 

. , r i .t . .* • . i . «. * . array, HG. 36 is a subsample ol the HG. 3a data, tor a 

It is also preferred that the interpolating means interpolate * ,. Ci 

bilinearly. (As suggested in discussion of the first main subsampbng ratio of two, 

aspect, we also prefer that the printer automatically skip past FIG. 3c is a data array derived by bilinear interpolation 

the pixels which are in the transmitted subsample, when 20 from the FIG. 36 subsample (with some additional assumed 

applying the trim data. This skipping function can be accom- data at certain edges), 

plished in any of a great variety of ways, as will be clear to FIG. 3d is an error or residual array derived by compari- 

people skilled in this field.) son of FIGS 3a 3c, 

In a third basic aspect or facet, the invention is a method ■» • * ■ j * . . 

of transferring image data into a printer. This method is for 25 .. FIG _^ 1S > tnm . data m * denved by applymg a thresh " 

printing of a corresponding image on a printing medium, by to ' 

construction from individual marks formed in pixel column- FIG. 3/ is a representation of one version of a run-length 

and-row arrays by a scanning multiple- nozzle pen. The pen encoded form of the FIG. 3e trim-data array; and 

operates in conjunction with a printing-medium advance FIGS. 4a through 4c are histograms and corresponding 

mechanism. 30 graphs of various error levels occurring in three represen- 

The method includes the steps of subsampling an entire tative stock photos, 
set of the image data, to form a subsample for transmission 

and interpolation; and evaluating error that will remain upon DETAILED DESCRIPTION OF THE 

interpolating the subsample. The method also includes the . . -- PREFERRED EMBODIMENTS 

- step of forming trim data for application to the interpolated 35 1. THE TRANSMISSION STAGE GENERALLY 

subsample to remove a significant part of that error. This step Data from the original raster image 11 (FIG. 1), for a 

is responsive to the error-evaluating means. pictorial presentation to be printed, passes along two paths 

In addition the method includes the step of transmitting 12, 22 to separate apparatus (and procedure) modules, 

the subsample and the trim data from the source to the (a) Subsampling — One of these modules is the fundamen- 

printer. Further steps of the method include storing, in 40 tal subsampling block 13, in which the amount of data along 

machine-readable form, automated instructions for all the each direction in the data array is reduced systematically by 

stated steps; and retrieving and implementing the automated a factor n to produce a subsample 14. As the subsampling is 

instructions to compress and transmit data. bilinear, by the factor n in each direction, the overall 

The foregoing may represent a definition or description of reduction in mass of data is by the factor n 2 . 
the third main facet of the invention in its most general or 45 We currently prefer to use n=2, so that the total reduction 
broad form. Even as thus broadly couched, this aspect of the in amount of data is to one-quarter of the original image- 
invention can be seen to importantly advance the art of array size. With the target compression of 3:1 in mind, this 
high-quality color printing. In particular the benefits dis- figure leaves the equivalent of one-twelfth of the original 
cussed above in regard to the first main aspect of the array size (16-VioVk) for trim data — including any neces- 
invention are enjoyed here in the context of an operating 50 sary encoding of that data. 

method. This smaller data array 14, too, is used in two ways. In the 

Nevertheless we prefer to practice this third aspect of the first, it passes entirely out of the transmission system of FIG. 

invention in conjunction with certain characteristics or fea- 1 as an output signal 14' to the printer, where it can be seen 

tures that optimize enjoyment of its benefits. For example it (FIG. 2) as the primary input signal, 

is preferred that the trim-data forming step include control- 55 (b) Interpolation — In the printer, this received image array 

ling the amount of trim data to maintain a significant 14' will be necessarily interpolated 31 to provide the full 

advantage in overall transmission time of the subsample and complement of data needed for printing the image. The data 

the trim data, relative to transmitting the entire image-data array 32 could be used in the printer without further 

set. processing, to print the corresponding image — but would be 

It is also preferable that the method further include — in 60 subject to errors, 

the printer — the steps of receiving the subsample and the These errors, for a typical image, would be noticeable in 

trim data, interpolating the subsample, applying the trim only a relatively small fraction of the picture — typically 

data to refine the interpolated subsample, and applying the some five percent to at most fifteen percent of the whole 

refined interpolated subsample to control printing of the image area. Nevertheless we assume that to a critical eye 

corresponding image. 65 they would be unacceptable. A philosophy of the present 

All of the foregoing operational principles and advantages invention is to try to supply to the printer, in essence, just 

of the present invention will be more fully appreciated upon enough information to correct those of the errors that would 
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in fact be noticeable (or most noticeable), and to disregard 
the rest of the errors. 

Now reverting to discussion of the transmission stage the 
reduced image array 14 (FIG. 1) also passes along another 
path 14" to a bilinear interpolation block 15. This module 15 
mimics or duplicates the operation of the necessary inter- 
polation block 31 (FIG. 2) in the receiving printer. 

The resulting interpolated array 16 is substantially iden- 
tical to the interpolated array 32 produced in the printer, in 
response to the transmitted signal 14'. This shadow interpo- 
lated array 16, in the transmission stage, will serve to (1) flag 
the noticeable errors mentioned above, and (2) supply data 
necessary to generating the correction information called for 
by the philosophy of the invention. 

(c) Residual-error determination — For this purpose the 
duplicate array 16 is compared 23 with the original image, 
received along the second path 22 mentioned at the begin- 
ning of this discussion. The comparison module 23 operates 
by subtracting the duplicate interpolated array 16 from the 
original data received 22 — or, as suggested in FIG. 1, by 
inverting the algebraic sign of the duplicate array 16 and 
then adding the inverted array 16 to the original-data infor- 
mation 22. 

The result of this subtraction 23 is an array of "residual" 
errors 24 — errors as between the interpolated data array 16 
or 32 and the original image. In practice such errors arise 
whenever the assumption, e. g. of linearity, that underlies the 
interpolation process fails. 

(In the remainder of this document we shall assume for 
purposes of simplicity in discussion that the interpolation 
processes 15, 31 performed in the transmitting and receiving 
stages are substantially the same — and specifically that both 
are linear in the two axes of a rectangular data array, as we 
prefer. Those skilled in the art will understand that-other- 
- interpolation techniques, e~ g. nonlinear, or strategically 
differing as between the two stages, can be substituted 
straightforwardly, and within the scope of the invention.) 

To the extent that the behavior of the image happens to be 
linear in the regions which are between image points that are 
retained in the subsample 14, the residual 24 will always be 
zero and call for no correction. Of course by definition no 
correction is ever needed at the data points retained in the 
subsample 14. 

(d) Downsampling the residual — Now it will be plain that 
the residual 24 describes the correction needed to produce a 
perfect image — i. e., to trim up the subsample 14, 14' to 
make it exactly the same as the original image 11, If all this 
information were transmitted to and suitably applied by the 
printer, however, while the resulting printed image would be 
"perfect" the total transmission time for these correction 
data alone could in principle be as long as it would have 
taken to simply transmit the entire original image 11. 

Hence the overall result, with transmission of the entire 
residual array 24, would be highly counterproductive. What 
is needed is a strategy for intelligently selecting some subset 
of what is needed to product a perfect image — namely, a 
trim-data subset that yields an image which is perceptually 
indistinguishable, or nearly so, from a perfect one. 

Perhaps in purest principle almost any selection criterion 
would be helpful. For example, if the first quarter of the 
residual array 24 were used and the remainder discarded, 
then the first quarter of the reproduced overall image would 
be significantly improved — so on average the overall image 
would be somehow better. 

The resulting inconsistency and discontinuity in quality, 
however, might be conspicuous — and furthermore this tech- 
nique is equivalent, in terms of information content, to 
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simply transmitting the entire first quarter of the original 
image and then subsampling only the rest. Therefore this 
strategy seems less than ideal. 

It may appear that a better strategy would be simply to 

5 systematically subsample the residual array, as this would at 
least distribute the improvement in a rational way through- 
out the image — but again this is informationally equivalent 
to merely a more-elaborate subsampling of the original 
image. Little or nothing is gained, and the additional com- 

10 putation 15, 23, 25, 27 becomes apparently only waste 
overhead. 

What is desired is a strategy that provides correction 
where correction is, in some sense, most needed. The 
apparatus of our invention indeed does exactly this — it 

15 implements a smart algorithm which in fact intelligently 
selects that very subset of the residual data which makes 
most effective possible use of the added computational 
resources and transmission time. 
We have already pointed out that the residual array 24 

20 describes the correction needed to produce a perfect image. 
In accordance with our invention, however, we consider it 
useful to recognize that the information in the residual data 
array 24 actually can be segregated into two different types 
of information, and these in turn can be utilized in two 

25 different system blocks (or process stages) respectively, 
(e) Using the residual itself to guide the downsampling — 
Thus first the magnitude of the residual 24 at each pixel is 
an indicator of departure from linearity, and therefore of the 
need for correction, at that pixel. Then secondly the mag- 

30 nitude and algebraic sign of the residual 24, considered 
together, provide a quantitative indication of the amount of 
correction or trimming -up that is needed at that pixel. 

Our invention accordingly first provides an evaluation, 
block 25 to assess the need for correction at each pixel, 

35 based upon the magnitude of the residual or error 24 at each 
pixel; and thereby selects, to be -transmitted, correction 
information 26 only for pixels where correction is most 
needed. In this assessment and selection module 25, the 
residual 24 is simply compared with a threshold value. 

40 (f) Candidate criteria for thresholding — As mentioned 
earlier, we have definite preferences concerning the manner 
of selecting this threshold. It is important to recognize, 
however, that our invention as most broadly couched 
encompasses a variety of different threshold selection pro- 

45 cesses or criteria. 

For example, the threshold may be selected arbitrarily; or 
it may better be selected as a so-called "educated guess" — a 
number that seems, based upon personal observation and 
analysis of a great amount of real-world image data, to lead 

50 to ignoring inconsequential errors and correcting those that 
produce visually noticeable shifts in images. Such an edu- 
cated guess might be made in terms of an absolute value, or 
in terms of a fraction of the signal values in the uninterpo- 
lated adjacent data points — or yet again in terms of more- 

55 complex functions of nearby signal values. 

Our strong preference is for selection of the threshold, for 
each part of the image, based on a very different kind of 
educated guess and to accomplish a somewhat more subtle 
goal. Before describing that selection process and its 

60 rationale, however, to put that more-sophisticated strategy 
into a meaningful perspective we will first complete our 
discussion of the transmission and reception processes based 
on the simple idea that some threshold is established and 
applied in the evaluation module 25. (We do propose the 

65 understanding, however, that the threshold is selected so that 
only some rather small fraction of the residual data exceeds 
the threshold.) 
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When this is done, some subset 26 of the residualdata 
array 24 is defined for transmission to the printer, and for 
application within the printer as corrections to trim up the 
subsampled image 14'. All other points in the subsample 14' 
receive no correction, or to put it another way they receive 
a correction of zero magnitude. 

(g) Encoding the trim data for reassembly — The nonzero 
corrections must be somehow identified, or in other words 
associated, with the image-data points to which they should 
be applied. One way to do this, for instance, would be to 
transmit a data-array location, in association with each 
correction value. 

Since the number of nonzero corrections is preferably a 
small fraction of the array size (columns multiplied by 
rows), only a small amount of location data would have to 
be transmitted. Using this approach for full-image 
transmission, some of the address sizes could be unwieldy; 
it might well be satisfactory, however, for transmission of 
only a swath at a time — since at least the number of rows 
requiring identification in each transmission batch would be 
small. 

We prefer a different way of identifying the nonzero 
corrections with their associated image-data points. We in 
effect construct a full-size array of trim data, i. e. an array 
roughly the same size as the original image array — but one 
in which all values are zero except those nonzero corrections 
selected by the evaluation or thesholding module 25. (We 
say "roughly" because, as mentioned earlier, those particular 
pixels which are included in the subsample are advanta- 
geously omitted from the trim data.) 

This roughly-full-size array, consisting (unlike the origi- 
nal image array) primarily of zeroes, is then ideally suited 
for the previously mentioned classical run-length encoding 
(RLE) module 27 and for transmission to the printer as an 
- RLE data stream 28. This encoding is rapid and inexpensive. 

While the subsample 14' and RLE stream 28 are dia- 
gramed for tutorial purposes as parallel, those skilled in the 
art will understand that it will commonly be most expedi- 
tious to transmit these two data streams 14', 28 in some sort 
of sequence. We prefer to transmit an entire block (e. g., a 
full swath) of the subsample 14' followed by all of the 
corresponding RLE trim data 28; however, many variant 
sequences are functionally equivalent and within the scope 
of the invention. 

2. THE RECEPTION (PRINTER) STAGE 

As previously mentioned the receiver preferably performs 
an interpolation 31, which is assumed to be bilinear and 
closely matching that 15 performed in the transmitter; and 
the resulting image 32 could be used without further 
processing, to print a page. Resulting image quality would 
be degraded in only a relatively few, usually isolated 
regions. 

According to our invention this degradation is removed, 
or very nearly so, through application of the trim data 28 
received preferably as an RLE bit stream. As mentioned 
earlier these data may be encoded in some other way that 
identifies the array locations where they should be applied. 

The trim data 28, however they were encoded, are now 
decoded correspondingly 33, to reconstitute the abbreviated 
residual 34 and so recover the thresholded data 26. (These so 
data differ from the full, prethreshold residual set 24 that was 
subjected to thresholding 25 in the transmitting stage.) 

Our preference for the use of an RLE process 27 at the 
transmission stage is in large part due to the extreme 
simplicity, economy, speed and reliability that can now be 65 
enjoyed in the decoding process 33. As can now be 
appreciated, our invention does indeed push the great bulk 
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of any processing complexity into the transmission stage, 
performed by the host computer; yet even that degree of 
complexity is not so great as to significantly impair opera- 
tion or throughput at the host. In particular generating the 
5 trim data stream 28 (apart from the subsampling process 13) 
requires only a single pass through the original data 11. 

The thresholded trim data 26, 34 which are thus recovered 
are primarily zeroes, with some fraction of nonzero correc- 
tion values. (As a practical matter these are well under half, 
10 as will be seen. All those values which correspond to 
positions within the subsample 14' are zero by definition, 
and so omitted from the trim data as mentioned previously.) 

Next the printer applies these trim data 34 to the inter- 
polated subsample 32, by simple addition 35, to produce a 
15 significantly closer approximation 36 to the original image 
11. Although labeled "reconstructed image" in FIG. 2, the 
reconstruction is not perfect and this image is "recon- 
structed" only to the extent that earlier, in the threshold 
process 25, none of the discarded correction points were in 
20 fact significant. 

The point labeled "reconstructed image" in FIG. 2 also 
represents final processing hardware (and associated 
firmware) 36 which does color correction if necessary, then 
preprinting rendition, and actual printing. In other words, 
25 just as FIG. 1 represents the host computer and its apparatus 
modules that perform the indicated steps, as mentioned in 
the preceding section of this document FIG. 2 represents the 
entire printer with its mechanical, fluidic and associated 
control modules 36 — i. e., printing heads, scanning 
30 apparatus, printing -medium advance subsystem, input and 
output trays for the printing medium, etc. 

To implement the decoder in the printer hardware imposes 
significant memory constraints. In fact, for our preferred 
— printing environment of a swath-scanning printer (for 
35 instance a therm al-inkjet printer), the printer itself is 
designed to handle image data on a per-swath basis — i. e., 
based on the amount of data that fills a space on the printing 
medium which is the full width of the image and the height 
of the printhead. (In the case of overlapping swaths, as for 
40 instance the use of three passes to complete inking of any 
given shallow strip across the printing medium, the refer- 
ence here should be understood to mean the amount of data 
that can be printed in one pass.) 

In a representative device this height is equivalent to 
45 sixty-four image lines, but can be higher (for example one 
hundred lines) or lower. Therefore, instead of storing the 
complete subsample 14 and the complete residual 24, and 
performing the decoding an the complete image, we prefer 
to optimize the implementation by operating the compres- 
50 sion algorithm on one swath at a time — meaning, typically, 
the amount of data printed in a single pass of the printhead, 
even though swaths may overlap. 

Although this implementtation is dictated or at least 
inspired by the decoder 33, as will be seen a similar benefit 
55 is felt at the encoder 27. The following section of this 
document explains that threshold decisions are also made 
independently for each swath. 

The image 36 can be printed conventionally, just as if the 
entire original data set 11 were present. 
3. ADAPTIVE THRESHOLDING 

It remains to explain our preferred choice of thresholding 
criteria mentioned in section 1(f) above. We have pointed 
out that any of a variety of criteria could effectuate major 
goals of our invention. 

One of these is the downsampling of the residual 24 to 
make its transmission practical. Another is the use of the 
magnitude of the residual 24 at each pixel as an indicator of 
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departure from linearity, and therefore of the need for 
correction, at that pixel. 

Our invention, however, is best practiced to optimize still 
another objective. In particular we have in mind the objec- 
tive of ensuring an approximately constant page throughput, 
particularly in view of — as an example — a current-day data- 
transfer rate that is roughly one-third of what is needed to 
transfer an entire image in a commercially acceptable prod- 
uct. 

Thus we wish to transfer enough information to print a 
perceptually accurate image, but take only about onethird of 
the time that would be needed to actually transmit the whole 
image to the printer. As will be understood from the earlier 
"BACKGROUND" section of this document, the figure 
"one -third" is a strong current preference derived from 
modern operating environments, but in the larger sense is 
merely exemplary. 

From the foregoing discussion it can be seen that we do 
not have enough time to transmit to the printer even half the 
original data, or indeed even one-third. If we took the time 
to transmit a third of the original data there would be no time 
left for transmitting the trim data 26, 28. 

Moreover, as to selection of a threshold, we wish to go 
further than simple selection of a threshold level that will 
reliably be high enough to guarantee a small enough amount 
of nonzero data to ensure our target (for example 3:1) overall 
compression. In addition we wish to be certain that we do 
not use a threshold that is too high, or an amount of nonzero 
data that is too small — since such usages unnecessarily 
compromise the quality of the eventual output image. In 
other words, the objective here is to select a threshold that 
is just high enough to yield a quantity of nonzero data that 
is just small enough to produce the target compression ratio 
and transmission time. 

Unfortunately, no single threshold value can produce 
these results for all or even most images. The required 
threshold for such idealized results will depend upon the 
details of data within each image. For each image, a different 
threshold is ideal. 

Our invention therefore preferably performs calculations 
working backward from the target compression ratio to find 
a threshold that will just produce, or very nearly so, the 
target ratio. These calculations require a known relationship 
between the threshold and the resulting numbers of nonzero 
values in the RLE stream (or equivalently the number of 
locations that must be addressed), and thus the overall length 
of the RLE stream. 

We have been unable to find such a relationship that is a 
priori — in other words, one that can be derived or found 
from first principles. Therefore we resort to an empirical 
relationship found from a large number of representative 
images. 

Nowadays such images can be obtained, for instance, as 
stock photos in CD-ROM catalogues. In FIGS. 4a through 
4c are data representative of a very great many such analyses 
which we have conducted on realistic image data, forming 
the basis of the empirical relation just mentioned — and 
advantageously incorporated into the practice of our inven- 
tion. 

We have demonstrated validity of this relationship not 
only for an image-data block size equal to a full image but 
also, relative to a swath-scanning printer, within the data 
block size of a representative swath — as, for instance, the 
sixty-four lines of our currently preferred scanning thermal- 
inkjet printing environment. 

Our research has shown that the amount of interpolation 
error in a bilinear subsample with n-fold reduction along 
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each axis of the array is typically quite small. The use of 
about five -percent nonzero trim data turns out happily to 
handle very effectively — in the great majority of cases — the 
data points at which interpolation error is significant. (In 

5 stating this value, we take into account the reduction in trim 
data that is obtained simply by omitting trim data for pixels 
that are in the subsample.) 

As can now be appreciated, the reception-stage operation 
is entirely independent (except for its numerical results) of 

id the particular type of thresholding module 25 that is 
employed in transmission. In other words, the printer pro- 
cessing (FIG. 2) will proceed through all the same steps 
whether the threshold used in transmission is selected 
arbitrarily, or based on one or another educated guess, or 

15 based on the adaptive or dynamic technique just described. 
This fact — that the decoder 33 is not required to have any 
knowledge of the thresholding scheme — makes the decod- 
ing algorithm very easy to implement. 
4. CASES IN WHICH THE INVENTION DOES NOT 

20 PERFORM WELL 

Due to the simplicity of the subsampling algorithm used 
in our invention, there are some limiting cases to its perfor- 
mance. One such case is a checkerboard pattern — 
alternating pixels being zero and full intensity (i. e., for an 

25 eight -bit dynamic range, 255). 

For such patterns the above-described modules that 
implement the algorithm (using n=2) break down, since 
interpolation between two zeroes always produces a large 
residual (namely 255) relative to the correct value of 255. 

30 Similarly, interpolation between two 255's always produces 
a large residual (namely -255) relative to the correct value 
of zero. 

In consequence the thresholding effort fails to produce 
any selection at alL(all errors -are large and none are zero), 

35 and the system cannot reduce the amount of trim data below 
the size of the full data array. The checkerboard pattern at 
pixel-grid periodicity, however, is not a typical photographic 
image and therefore failure of the invention to deal with such 
a pattern is not of great concern. 

40 Another Limiting case is black text. With its many sharp 
transitions, an image of black text imposes severe strain on 
our algorithm: since sharp transitions are degraded by sub- 
sampling and interpolation, the resulting image appears 
fuzzy unless the system transmits a very large amount of 

45 trim data. 

Again, the system cannot produce desirable quality while 
operating to the design objectives stated earlier. Since sub- 
stantial amounts of text are present in many images, failure 
of the invention to deal with text is a significant limitation. 

50 Two solutions are available for dealing with text. A 
preferred tactic is provision of a separate text data path for 
pure text printing — even in mixed text-and-image docu- 
ments. A second approach is to provide a relaxed threshold 
for small areas of text, such as captions and labels in 

55 otherwise pictorial images. 

Both these strategies require somehow recognizing that 
text is present. Such recognition can be accomplished in a 
variety of ways, as for example the text recognition methods 
in the above-mentioned patent document of Jae Noh. 

60 Thus our invention encompasses the preferred technique 
of operating a separate text-recognition module 18, that 
receives 17 and screens the original image 11, and can 
trigger operation of a discrete transmission path 19 for text. 
Being optional, this text path 17-19 is drawn in the dashed 

65 line. 

In essence the text information 17, 19 bypasses the 
subsampling block 13 and is transmitted directly without 
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compression — but with location information for reinsertion 
of the text into its proper position in the image. Here 
naturally some throughput is sacrificed to preserve text 
reproduction quality. 

5. AN EXAMPLE OF DATAFLOW IN TRANSMISSION 

Tracing a hypothetical small image block through the 
several process steps may be helpful in understanding the 
invention. Such an initial data array 11 appears in FIG. 3a, 
as data rows a through d, with columns a through f : thus the 
pixels of the first row are aa, ab, ac . . . af; and those of the 
second row are ba, be . . . bf; and so on to the bottom row 
with pixels db . . . df. 

As shown, each pixel has three associated eight -bit data 
values; these may be considered as corresponding to the 
three primary lights red, green and blue respectively. In other 
words each of the primaries can have any value from 2°-l 
through 2 8 -l, which is to say 0 through 255 inclusive. 

In pixel aa for example the red level is 253, the green 18 
and the blue 92. At the opposite corner of the array the last 
pixel df has red level 119, green 40 and blue 14. 

As can be seen the array has 4 rowsx6 columns-24 pixels. 
This array 11 is to be taken as corresponding to the array 11 
in FIG. 1, except that the values in this hypothetical array are 
not at all intended to be representative of any particular real 
image type; they are only arbitrarily chosen for purposes of 
illustrating how the processing works. 

One additional row e and column g are included beyond 
the borders of the array 11 proper, for the purpose of 
definiteness in showing interpolation for the data in row e 
and column f. (These additional data points are provided 
only because the array is very small and the example would 
be less helpful without ability to interpolate in the last row 
and column.) 

For n=2 the system retains data.in only every other-pixel, 
in both directions of the array — so the subsample 14 (FIG. 
3b) has just 2 rowsx3 columns=6 pixels, one quarter of the 
number (24) in the original image array 11. The pixels in the 
subsample — and the data for those pixels — are identically 
pixels aa, ac, ae, ca, cc, and ce. The exemplary subsample 14 
here is to be taken as corresponding to the subsample 14 in 
FIG. 1, except of course that they are derived from the 
arbitrary, hypothetical original image of FIG. 3a. 

Next, in anticipation of an interpolation in the printer as 
mentioned earlier, the FIG. 3b data are padded out by 
interpolation to produce the array 16 of FIG. 3c. The amount 
of data is thus returned to the same size as the original array 
11 of FIG. 3a. 

In this step the six data points of FIG. 3b are retained 
without change. For our present tutorial purposes they are 
therefore somewhat less interesting than the other points in 
FIG. 3c and therefore have been shown in relatively smaller 
type. 

The points that are new are all the other eighteen pixels in 
FIG. 3c. Those pixels in row a which are newly found by 
interpolation are ab', ad' and af ; and similarly the pixels in 
row d newly inserted by interpolation are db 1 , d' and df . 
Analogously certain pixels are added in the two intervening 
rows. 

Pixel ab' is typical of pixels that are between two pixels 
of the subsample 14 (FIG. 3b) — along a row or a column. 
Each pixel in this category is calculated simply as the 
average of those two original pixels. In the case of ab', it is 
found by averaging original pixels aa and ac. 

More specifically, each color value is found by averaging 
the corresponding two color numbers for the two horizon- 
tally or vertically adjacent pixels. That is, the red color 
number equals the average of the two red color numbers for 
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the two horizontally or vertically adjacent pixels; the green 
number (i. e., the number for green) likewise is equal to the 
average of the two original greeo numbers; similarly for 
blue. In the array 16, twelve of the eighteen new pixels are 

5 filled in by this procedure. 

Six of the pixels, however — those shown in boldface in 
FIG. 3c — are between original pixels only along diagonals. 
These must be calculated as the average of four values, 
namely the four that are at four corners relative to the pixel 

10 of interest. 

For example pixel bb' is found by averaging original 
pixels aa, ac, ca and cc (or alternatively from the four 
two -pixel interpolations at left, right, above and below). This 
process may be seen as the most intensive aspect of so-called 

15 "bilinear" interpolation, i. e. interpolation along two lines at 
the same time to generate only a single output data point. 
Here too, naturally, the values for the three color primary 
lights are handled independently — four red numbers being 
averaged to obtain the interpolated red number, four green to 

20 obtain the new green number, and four blue to find the new 
blue. 

As mentioned earlier, completion of row d and column f 
in FIG. 3c requires use of the auxiliary data provided at row 
e and column g in FIG. 3a. This is not called for along the 

25 left edge or top edge of the data block: the interpolation 
geometry is different. In actual practice, with much larger 
data blocks, edge effects are simply not very important and 
it is not necessary to be concerned at the asymmetry in 
interpolation geometry that may occur at the right edge and 

30 bottom edge of the block — or, alternatively, the asymmetry 
can be avoided by using an odd number of rows or columns, 
or both, in the data block. 
After the system has thus filled out this bilinear ly_ inter- 
- polated array 16, this array is compared with the original 

35 array 11 (FIG. 3a), by subtracting the former from the latter, 
to develop the residual data 24 (FIG. 3d). As can be seen, for 
example, in the upper-right-hand corner of the residual array 
24, the difference datum or residual for the red light is 
194-198—4; the residual for green is 91-94—3; and the 

40 figure for blue is 173-166-7. 

Necessarily the residual for the pixels in the subsample 14 
will be zero for all three color planes, as seen in FIG. 3d by 
the triplets of zeroes. These are printed in smaller type to 
emphasize their relationship to the original -pixel values — 

45 which are similarly printed in small type in FIG. 3c as 
mentioned above. The residual data 24 of FIG. 3d is to be 
understood as corresponding (again within the hypothetical 
character of the overall example) to residual 24 of FIG. 1. 
Next a threshold is applied to these data, to reduce the 

50 overall number of nonzero values in the interest of trans- 
mission brevity as previously explained. At the same time 
we take the opportunity to remove those of the zero values 
that are always zero by definition — namely, the above- 
mentioned six triplets of zeroes that are printed in smaller 

55 type in FIG. 3d. 

It would be pointless to transmit these eighteen values, 
since they would be redundant information and the receiver 
can simply be instructed that the values in these pixels are 
never modified. Accordingly these positions are all left 

60 blank in the thresholded data of FIG. 3e. 

With this step, the number of values to be accounted for 
in the RLE transmission process is reduced by twenty-five 
percent — from seventy-two to fifty-four (18 pixelsx3 num- 
bers per pixel=54 values in the block). In accordance with 

65 the most highly preferred form of our invention, the number 
of nonzero values is simply reduced to a fixed fraction of 
that total number (fifty-four) of values in the block. 
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In the system of greatest current interest, that fixed 
fraction is about five percent. This fraction of our hypotheti- 
cal data array of only fifty- four trim-data values comes to 
just under three numbers. In other words, about three 
numbers should be retained and all the rest discarded, i. e. 
reset to zero. (Of course in a real data block of say, for 
example, 150,000 pixels or more for a swath, the number of 
nonzero pixel values would be on the order of at least 5,500 
trim data points to be retained and transmitted.) 

It is important to optimization of our invention that the 
numbers retained be those particular numbers, preferably 
five percent or in our small example three numbers (for all 
three primary colors in the aggregate), whose absolute 
values are largest. We have set forth previously that the 
values of the residual data themselves serve as the selection 
criterion, for the points that are most important to keep and 
use — as well as serving in the role of the actual correction 
quantities to be applied later in the receiver. 

The results of this selection appear in FIG. 3e: the highest 
absolute values are a negative twenty-seven at pixel be", 
negative twenty-one at bf", and negative twenty at df 
respectively — all for blue. (It is only a coincidence that all 
three numbers are negative and in the blue plane, for the 
particular arbitrary data assumed.) 

FIG. 3e thus represents the results of an adaptive, or 
dynamic, thresholding process. It may then be asked what 
the threshold itself — i. e., the value above which error values 
are transmitted to the printer — actually is. 

A first answer to this is that the numerical threshold itself 
is not of particular interest, since the particular value of the 
threshold is applicable only to this particular numerical 
example — and the threshold for every other example will be 
at least somewhat different. 

Having said this, however, we can add that the threshold 
value is implicit in die numbers of FIGS. 3rf-and 3e; It is an 
absolute' value of nineteen (as seen for example at the 
unselected value for red at pixel df in FIG. 3d), since the 
smallest absolute value selected is twenty. 

More important is the adaptive threshold selection pro- 
cess. For the entire array, first the error having the highest 
absolute value is selected. 

If the number of occurrences of this error equals or 
exceeds the selected fraction (e. g., five percent) of the total 
number of values (fifty-four), then the selection is 
complete — except that if the number of occurrences exceeds 
the selected fraction by a significant number, then arbitrarily 
the system must pick some of the occurrences to discard, so 
as to eliminate the excess and thereby enforce the design 
objectives. 

If the number of occurrences selected does not equal or 
exceed the selected fraction, then the error having the next 
highest absolute value is selected The exception condition 
discussed in the preceding paragraph is then applied again — 
but with respect to the sum of occurrences of the highest two 
error values noted, rather than just the one highest. We 
iterate this process automatically until eventually the allot- 
ment is satisfied. 

Since the overriding objective is to make a modest 
improvement in overall quality while remaining within our 
time allotment — in other words, while maintaining a 3:1 
compression and avoiding time-consuming calculations — 
we are selecting the highest three values without regard to 
the color plane in which each value is found. As will be 
understood, reasoning could be presented in favor of instead 
allocating these values among the several color planes, as for 
instance in our example roughly one value for each color 
plane. 
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Such a strategy is not unreasonable, and is within the 
scope of our invention. Indeed it might be deemed 
preferable, on the basis that, e. g., making the selection 
across the board may neglect an error value in one color 
5 plane that is a large relative error (because the actual color 
number for that color in the pixel is small) — while instead 
selecting an error value in another color plane that is a small 
relative error (because the actual color number for that color 
is large). 

id This reasoning, while seductive, leads into troublesome 
areas: visual colorimetric effects are not necessarily propor- 
tional to relative errors, since the eye is much more sensitive 
to some colors than others; and the computations needed to 
determine which errors are really most important visually 

15 can be inordinately complex. Accordingly we prefer to rest 
on the overall rationale stated above — a modest improve- 
ment in quality, while maintaining simplicity and the target 
compression. 

(As mentioned earlier, the invention is usable not only 
20 with eight-bit RGB transmissions to a printer but also with 
other numbers of bits, and/or with CMYK transmissions etc. 
We prefer RGB transmission for a product of present interest 
because that product at the actual printing stage utilizes a 
relatively quite complicated CMYK structure, actually hav- 
25 ing plural colorants for certain of the primaries — so that in 
effect more than four primaries are present. Accordingly, 
RLE transmission of trim data for this form of CMYK 
printer-space variables would be considerably more compli- 
cated and time consuming. The tradeoff accepted here is that 
30 the conversion from RGB to the relatively complex CMYK 
system must be performed by the printer rather than the 
host.) 

It remains only to encode these data for transmission, and 
the results of that process appear as the RLE streanf28 of 

"35 "FIG. 3/ — once again, corresponding to the like-labeled 
stream 28 in FIG. 3a. The stream consists of data pairs: each 
entry in column A represents the number of occurrences of 
a particular trim -data value in an uninterrupted sequence or 
"run" (ignoring the blank pixels in FIG. 3e); and the adjacent 

40 entry in column B represents that particular trim-data value. 
The first two data pairs, rows A and B, account for all the 
pixels (all zeroes) in the red and green planes, plus the first 
eight variable pixels for blue. From FIG. 3e it can be seen 
that there are forty-three occurrences of zero in a row (again, 

45 disregarding the blanks) in this first category, and entries AA 
and AB in FIG. 3/ respectively read "43" and "0". 

Following in FIG. 3e is a single occurrence of negative 
twenty-seven, at pixel be". In FIG. 3/ this is reflected at 
entries BA and BB, reading "1 M and "-27". 

50 Next in FIG, 3e come a negative twenty-one, then eight 
variable-pixel zeroes, and finally a negative twenty — all for 
blue. These are represented by the remaining data in the last 
four rows of FIG. 3/, including "1" for "-21", "8" for "0", 
and "1" for "-20". 

55 The system recognizes the end of the RLE data for a 
particular color plane by the point at which the sum of all the 
column-A values cumulates to the number of pixels in the 
array — i. e., eighteen for each plane. It is possible, however, 
for a run to bridge two color planes as in the case of the 

60 forty-three zeroes mentioned above and shown in row D of 
FIG. 3/. The receiving system is trained to distribute these 
appropriately into the color planes involved, as well as 
appropriately managing the nonvariable pixels. 
The example as presented here does not represent realis- 
es tically the relative amounts of data to be transmitted in the 
RLE stream 28 and subsample 14. As can be seen in FIG. 3/, 
there are five data pairs (ten pieces of data). 
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For a fairer comparison with the amount of data in the 
original array 11 or the subsample 14, these five data pairs 
might be regarded as therefore the equivalent of about three 
data triplets. From this analysis of the small-size example it 
accordingly might be thought that the RLE data stream is 5 
comparable (about half) in bulk and in transmission time to 
the six-triplet subsample 14. 

Actually, however, the RLE stream itself can be advan- 
tageously transmitted in special formats to exploit, for 
example, the facts that the number of bits needed to identify 1Q 
a "zero" is small, and "zero" is the most prevalent entry in 
the tabulation. Moreover a full-size sample tends to develop 
a far lower RLE overhead statistically. In practice the RLE 
stream 28 requires well under one third the transmission 
time of the subsample 14 — sometimes even staying at five 
percent. 15 

Much more realistic data appear in the histographic 
presentations of FIGS. 4a through 4c. These will be self 
explanatory to people skilled in this field. 

6. A CORRESPONDING EXAMPLE OF DATA FLOW IN 
RECEPTION 20 

At the printer the subsample 14 (FIG. 3b) is received with 
the RLE stream 28 (FIG. 3^. The subsample is interpolated 
31 (FIG. 2) to produce an interpolated array 32 just like the 
array 16 (FIG. 3c). 

The RLE stream 28 is decoded 33 to reconstitute a 25 
trim-data array 34 just like the outgoing trim-data array 26 
(FIG. 3e). Now this trim-data array is applied 35 to the 
interpolated array, yielding a refined approximation 36 to the 
original data array 11. (It is at this point that the printer skips 
over the pixels which are in the subsample, so that the 30 
blanking of those pixels as in FIG. 3e does not throw the 
reconstruction process out of synchronization.) 

This improved approximation 36 appears in FIG. 3g, 
which represents simply addition of the arrays 16 and 26 of 
FIGS. 3c and 3e. It can be noted that at the pixels where 35 
nonzero -entries appear in FIG. 3e — namely the pixels at 
which the interpolated array 32 was most seriously out of 
agreement with the original array 11 — the refined, final 
approximation 36 is now identically in agreement with that 
original. 40 

These particular values appear in boldface type in FIG. 
3g. They exhibit restoration of the original data at all the 
pixels where nonzero entries appear in FIG. 3e — namely the 
values of "80" and "78" in pixels be' and bf respectively, 
and "14" in pixel df— all for blue. 45 

All of the remaining errors that were disregarded, by 
application of the threshold to the residual 24 in FIG. 3d, are 
smaller — and in the great majority of cases much smaller — 
than those which were corrected. 

7. CONCLUSION 50 
Our system and method, even though constrained to 

maintain the target compression ratio, turns out in practice 
for photograph-like images to satisfy the previously men- 
tioned condition of being "visually lossless". While we do 
discard some data in the trim-data forming means, the 55 
selection of data to be discarded is just sufficiently cautious 
that the difference is not easily seen. 

The above disclosure is intended as merely exemplary, 
and not to limit the scope of the invention — which is to be 
determined by reference to the appended claims. eo 

What is claimed is: 

1. In a system for transferring pixel-based image data 
from an image-data source into a printer, and for using said 
data in the printer, the combination of: 

means for subsampling an entire set of image data, to ss 
form a subsample for transmission and later interpola- 
tion; 
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means for evaluating error that will remain upon interpo- 
lating the subsample; 

means, responsive to the error-evaluating means, for 
forming an amount of trim data for application to the 
interpolated subsample to remove part of said error; 

wherein the trim-data forming means control the amount 
of trim data to maintain an advantage in overall trans- 
mission time of the subsample and the trim data, 
relative to transmitting the entire image-data set; and 

means for transmitting, to the printer, the subsample and 
the trim data; 

wherein the trim-data forming means further comprise 
means for padding out said controlled amount of trim 
data with zeroes to form a complete trim-data array; 
and 

further comprising means for run-length encoding the 
padded-out trim-data array. 

2. In a system for transferring pixel-based image data 
from an image-data source into a printer, and for using said 
data in the printer, the combination of: 

means for subsampling an entire set of image data, to 
form a subsample for transmission and later interpola- 
tions; 

means for evaluating error that will remain upon interpo- 
lating the subsample; 

means, responsive to the error- evaluating means, for 
forming an amount of trim data for application to the 
interpolated subsample to remove part of said error; 
and 

means for transmitting, to the printer, the subsample and 
the trim data; and wherein: 

the trim -data forming means control the_ amount of trim 
data to maintain an advantage in overall transmission 
time of the subsample and the trim data, relative to 
transmitting the entire image-data set; 

the trim-data forming means control the amount of trim 
data for a data block, at a substantially constant 
fraction of the number of points in said data block, 
substantially independent of the error-evaluating 
means; and 

said data block is either the entire image-data set or one 
of a group of defined portions thereof. 

3. The combination of claim 2, particularly for use with a 
printer that constructs an image on a printing medium by 
scanning of a multielement printhead across the medium to 
form swaths of dots; and wherein: 

each of said portions is a swath of dots; and 

the trim -data forming means control the amount of trim 

data for each swath at a substantially constant fraction 

of the number of points in the swath. 

4. The combination of claim 3, wherein: 
said fraction is roughly five percent. 

5. The combination of claim 2, further comprising: 
means for transmitting pure-text portions of an image 

separately from said subsample and said trim data. 

6. The combination of claim 2, further comprising: 
means for storage of automated instructions for operation 

of all of the foregoing means; and 
means for retrieval of the instructions from storage and 
application of the retrieved instructions to operate all of 
said foregoing means. 

7. In a system as described in claim 2, the combination of 
claim 1 further comprising in the printer: 

means for receiving the subsample; 
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means for interpolating the subsample, to form an 

approximation of the entire set of image data; 
means for receiving the trim data: 
means for applying the trim data to the interpolated 

subsample to remove part of said error; and 5 
means for applying the trimmed interpolated subsample to 

control printing of a corresponding image by the 

printer. 

8. The combination of claim 7, further comprising in the 
printer: 

means for storage of automated instructions for operation 
of all of the foregoing receiving, interpolating and 
applying means; and 

means for retrieval of the instructions from storage and 15 
application of the retrieved instructions to operate all of 
said foregoing receiving, interpolating and applying 
means. 

9. The combination of claim 8, wherein; 

the interpolating means interpolate bilinearly, 20 

10. A system for receiving and using pixel-based image 
data from an image-data source, in a printer; said system 
comprising: 

means for receiving a subsample of an entire set of image 
data; 

means for interpolating the subsample, to form an 
approximation of the entire set of image data; 

means for receiving trim data that comprise a controlled 
amount of trim data, padded out with zeroes to form a 30 
complete trim-data array, and run-length encoded; 

means for run-length decoding the padded-out trim data; 

means for applying the trim data to the interpolated 
subsample to remove a significant part of said error; 
and ------ 35 

means forapplying the trimmed interpolated subsample to 
control printing of a corresponding image by the 
printer. 

11. The system of claim 10, further comprising: 
means for storage of automated instructions for operation 

of all of said means; and 
means for retrieval of the instructions from storage and 
application of the retrieved instructions to operate all of 
said means. 
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12. The system of claim 10, wherein: 

the interpolating means interpolate bilinearly. 

13. A method of transferring image data into a printer, for 
printing of a corresponding image on a printing medium by 
construction from individual marks formed in pixel column- 
and-row arrays by a scanning multiple -nozzle pen that 
operates in conjunction with a printing-medium advance 
mechanism; said method comprising the steps of: 

subsampling an entire set of the image data, to form a 
subsample for transmission and interpolation; 

evaluating error that will remain upon interpolating the 
subsample; 

responsive to the error-evaluating means, forming trim 
data for application to the interpolated subsample to 
remove a significant part of said error; 

transmitting the subsample and the trim data from the 
source to the printer; 

storing, in machine-readable form, automated instructions 
for all the foregoing steps; and 

retrieving and implementing the automated instructions to 
compress and transmit data; 

wherein the subsampling step controls the amount of trim 
data for a data block, at a substantially constant fraction 
of the number of points in said data block, substantially 
independent of the error-evaluating step; said data 
block being defined as either the entire image-data set 
or one of a group of defined portions thereof. 

14. The method of claim 13, wherein: 

the trim-data forming step further comprises controlling 
the amount of trim data to maintain an advantage in 
overall transmission time of the subsample and the trim 
data, relative to transmitting the entire image-data" set." 

- " 15. The method of claim 14, further comprising, in the 

printer: 

receiving the subsample and the trim data; 
interpolating the subsample; 

applying the trim data to refine the interpolated sub- 
sample; and 

applying the refined interpolated subsample to control 
printing of the corresponding image. 
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